Bayesian Model Averaging Across Model Spaces via Compact Encoding
نویسندگان
چکیده
Bayesian Model Averaging (BMA) is well known for improving predictive accuracy by averaging inferences over all models in the model space. However, Markov chain Monte Carlo (MCMC) sampling, as the standard implementation for BMA, encounters difficulties in even relatively simple model spaces. We introduce a minimum message length (MML) coupled MCMC methodology, which not only addresses these difficulties but has additional benefits. The MML principle discretizes the model space and associates a probability mass with each region. This allows efficient sampling and jumping between model spaces of different complexity. We illustrate the methodology with a mixture component model example (clustering) and show that our approach produces more interpretable results when compared to Green’s popular reverse jump sampling across model subspaces technique. The MML principle mathematically embodies Occam’s razor since more complicated models take more information to describe. We find that BMA prediction based on sampling across multiple sub-spaces of different complexity makes much improved predictions compared to the single best (shortest) model.
منابع مشابه
Modeling Factors Affecting Tax Evasion in Iran's Economy Based on the Bayesian averaging approach
This study seeks to model tax evasion and identify how effective factors affect tax evasion in the Iranian economy. Recent models show the failure of traditional models; Models do not have enough ability to model hidden variables such as tax evasion. The present study considers this failure in identifying explanatory variables and experimental model design. To achieve this, the Bayesian averagi...
متن کاملPredicting waste generation using Bayesian model averaging
A prognosis model has been developed for solid waste generation from households in Hoi An City, a famous tourist city in Viet Nam. Waste sampling, followed by a questionnaire survey, was carried out to gather data. The Bayesian model average method was used to identify factors significantly associated with waste generation. Multivariate linear regression analysis was then applied to evaluate th...
متن کاملMethodology for Bayesian Model Averaging: An Update
The standard practice of selecting a single model from some class of models and then making inferences based on this model ignores model uncertainty. Ignoring model uncertainty can impair predictive performance and lead to overstatement of the strength of evidence via p-values that are too small. Bayesian model averaging provides a coherent approach for accounting for model uncertainty. A varie...
متن کاملMessage Length Estimators, Probabilistic Sampling and Optimal Prediction
The Rissanen (MDL) and Wallace (MML) formulations of learning by compact encoding only provide a decision criterion to choose between two or more models, they do not provide any guidance on how to search through the model space. Typically, deterministic search techniques such as the expectation maximization (EM) algorithm have been used extensively with the MML/MDL principles to find the single...
متن کاملBayesian Model Averaging for Propensity Score Analysis.
This article considers Bayesian model averaging as a means of addressing uncertainty in the selection of variables in the propensity score equation. We investigate an approximate Bayesian model averaging approach based on the model-averaged propensity score estimates produced by the R package BMA but that ignores uncertainty in the propensity score. We also provide a fully Bayesian model averag...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004